Eagle: A Better Hybrid Data Center Scheduler
نویسندگان
چکیده
Eagle is a new hybrid data center scheduler that considerably improves the job completion times for short jobs. Eagle builds on the Hawk hybrid scheduler, using a centralized scheduler for long jobs and distributed schedulers for short jobs. The main innovation in Eagle is that it provides an approximate and potentially slightly outof-date summary of the centralized scheduler state to the distributed schedulers to guide them in their scheduling choices. This approach leads to much better job completion times for short jobs, compared to the random probing and work stealing used in Sparrow and Hawk. The challenge is how to provide this information to the distributed schedulers without negatively impacting the performance of the centralized scheduler, and therefore the job completion times of the long jobs. The approximate nature of the information provided to the distributed schedulers is the key to an efficient implementation. The information is piggybacked on task assignment messages by the centralized scheduler, and eventually forwarded to the distributed schedulers. In particular, our implementation does not require the centralized scheduler to send any additional message, and only requires a very modest amount of computation and additional data to be sent. We evaluate Hawk by trace-driven simulation and a prototype implementation in Spark. Using the popular Google, Yahoo and Cloudera production traces, Eagle improves 50th, 90th and 99th percentiles of the short jobs completion times distribution by a factor of 1.72 on average, and up to 8.33, with a negligible impact on long jobs completion times (< 1% slowdown on average).
منابع مشابه
Eagle job-aware scheduling: divide and ... reorder
We present Eagle, a new hybrid cluster scheduler for data-parallel programs, consisting of a centralized scheduler for long jobs and a set of distributed schedulers for short jobs. Eagle incorporates two new techniques: succinct state sharing and sticky batch probing. With succinct state sharing, the centralized scheduler informs the distributed schedulers of the placement of long jobs in a low...
متن کاملA neural network job-shop scheduler
This paper focuses on the development of a neural network (NN) scheduler for scheduling job-shops. In this hybrid intelligent system, genetic algorithms (GA) are used to generate optimal schedules to a known benchmark problem. In each optimal solution, every individually scheduled operation of a job is treated as a decision which contains knowledge. Each decision is modeled as a function of a s...
متن کاملHardware-software architecture for priority queue management in real-time and embedded systems
The use of hardware-based data structures for accelerating real-time and embedded system applications is limited by the scarceness of hardware resources. Being limited by the silicon area available, hardware data structures cannot scale in size as easily as their software counterparts. We assert a hardware-software co-design approach is required to elegantly overcome these limitations. In this ...
متن کاملThe Estuarine and Great Lakes (EaGLe) Coastal Indicators Program: Centers Developing the Next Generation of Indicators
respectively), it has not been subjected to the Agency's required peer and policy review and therefore does not necessarily reflect the views of the Agency and no official endorsement should be inferred. Executive Summary The Estuarine and Great Lakes (EaGLe) program was established to develop a suite of new, integrative indicators of ecological condition for the coastal zones of the contiguous...
متن کاملHawk: Hybrid Datacenter Scheduling
This paper addresses the problem of efficient scheduling of large clusters under high load and heterogeneous workloads. A heterogeneousworkload typically consists of many short jobs and a small number of large jobs that consume the bulk of the cluster’s resources. Recent work advocates distributed scheduling to overcome the limitations of centralized schedulers for large clusters with many comp...
متن کامل